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Abstract 

^^ Let J^ be a family of graphs. A graph is J- -free if it contains no copy of a graph in J^ as a 

,_^ subgraph. A cornerstone of extremal graph theory is the study of the Turdn number ex(n, J^), 

^^ the maximum number of edges in an J^-free graph on n vertices. Define the Zarankiewicz number 

^^ z{n,T) to be the maximum number of edges in an J^-free bipartite graph on n vertices. Let Ck 

'^ denote a cycle of length k, and let Ck denote the set of cycles Ce, where 3 < i < k and i and 

K-5 k have the same parity. Erdos and Simonovits conjectured that for any family J-' consisting of 

^f) bipartite graphs there exists an odd integer k such that ex(n, J^UCk) ~ z(ri, J^). They proved this 

OQ when J^ = {C4} by showing that ex(n, {C4, C5}) ~ z(n, C4). In this paper, we extend this result 

^_^ by showing that if ^ € {2, 3, 5} and k > 2£ is odd, then ex(n, €21 U {Ck}) ~ z(ri, €21)- Furthermore, 

^^ if A: > 2£ + 2 is odd, then for infinitely many n we show that the extremal €21 U {Cfc}-free graphs 

r ^ are bipartite incidence graphs of generalized polygons. We observe that this exact result does not 

hold for any odd k < 2£, and furthermore the asymptotic result docs not hold when {£, k) is (3, 3), 
(5,3) or (5,5). Our proofs make use of pseudorandomness properties of nearly extremal graphs 
that are of independent interest. 
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1 Introduction 



Given a family F of graphs, a graph is F-free if it contains no copy of a graph in J-" as a subgraph. 
The Turdn number ex(n, T) is the maximum number of edges in an J^-free graph on n vertices. 
When T = {F} consists of a single forbidden graph we denote the Turan number by ex{n,F). A 
^^ classical theorem of Turan |39j gives an exact result for ex{n,Kt), where Kt is the complete graph 

on t vertices: the unique largest i^^-free graph on n vertices is the complete {t — l)-partite graph 
with part sizes as equal as possible. In general, Erdos, Stone and Simonovits [14^ [T5] showed that 
ex(n, F) = (1 — 1/r) (2) +o{n'^), where r = min{x(F) — 1 : F G F}. This determines the Turan number 
asymptotically when F consists of non-bipartite graphs. However, much less is known concerning 
C^ the Turan numbers of bipartite graphs. There is no bipartite graph F containing a cycle such that 

ex(n, F) is known exactly for all n. In fact, even the order of magnitude of ex(n, F) is not known for 
quite simple bipartite graphs, such as the complete bipartite graph with four vertices in each part, 
the cycle of length eight, and the three-dimensional cube graph. 

In this paper, we study the effect of forbidding short odd cycles on bipartite extremal problems, in 
particular, the extremal problem for even cycles. Let Ck denote a cycle of length k and let Ck denote 
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the set of cycles Ci where 2> < i < k and t and k have the same parity. Bondy and Simonovits [8] 
showed that ex(n, C2£) = 0(n-^"'"^/^), which was further improved by Lam and Verstraete [26j to 

ex(n,C2^)< V+i/'^ + 0(n). (1) 

It is a notoriously difficult problem to determine even the order of magnitude of ex(n, C2i) or z(n, C2i) 
for £ {2,3,5}. For £ G {2,3,5}, the existence of structures from projective geometry called 
generalized polygons show ex(n,C2^) = 0(n^"^^' ). Erdos and Simonovits |13j conjectured more 
generally that ex(n,C2^) = 0(n^+^' ) for all I > 2, and this conjecture remains open. For large i, 
the densest known C2£-free graphs on n vertices are the recent constructions of Ramanujan graphs 
based on octonions, due to Dahan and Tillich f9], superseding earlier constructions of Margulis [31], 
Lubotzky, Phillips, and Sarnak J29J, and Lazebnik, Ustimenko and Woldar |27j. The constructions 
have n^"*"^ edges where ^ ~ ^ as £ — )• cxd. For £ = 4, the current best bounds are cin^'^ < ex(n, €21) < 
C2n^'^ for some constants ci,C2 > ~ the lower bound is from the constructions of Benson [6j and 
Singleton ^6], whereas the upper bound follows from (jTl) with i = 4. In this paper, we study the 
relationship between extremal €21 U {Cfc}-free graphs with k odd, and extremal C2£-free bipartite 
graphs. 

1.1 Main Results 

Define the Zarankiewicz number z(n, T) to be the maximum number of edges in an J^-free bipartite 
graph on n vertices. Erdos and Simonovits [131 Conjecture 3] conjectured that for any family J^ 
consisting of bipartite graphs there exists an odd integer k such that ex(?i, J^U Ck) ~ z{n,T). They 
proved this when J^ = {C4} by showing that ex(n, {C4, C5}) ~ z(n, {C4}). In this paper, we extend 
this by proving the following theorem which verifies the Erdos-Simonovits conjecture when J^ = €21 
for £e {2,3,5}. 

Theorem 1.1 Suppose i E {2, 3, 5} and k > 2i is odd. Let G be a C2^U{Cfc}-/ree graph on n vertices 
with average degree d = Q{n^i^). Then G has a bipartite subgraph H with at least d^~^^ — o{n^~^^'^) 
edges. If also d > (1 + o(l))(n/2)i/^ then d ~ {n/2fl'^ and e{H) ~ e{G) ~ (n/2)i+i/^ In particular 

ex(n,C2£U {Ck}) ~ z{n,C2(). 

Note that the statement in Theorem |1.1| is stronger than that of the conjecture in two ways. 
Firstly, we replace Ck by C^, i.e. we forbid a single odd cycle rather than all short odd cycles. 
Secondly, we obtain a stability theorem: any graph which is close to extremal for C21!, U {C^} must 
be close to bipartite. This is the first ingredient in applying the stability method, in which one 
first obtains approximate structure, and then eliminates any imperfections to obtain exact structure. 
Using this theorem we can prove the following exact result. The reader unfamiliar with generalized 



+ l)-gons can find a brief description in Section 3.7 these can be viewed for infinitely many prime 



powers g as (g + l)-regular C2£-free extremal bipartite graphs with q^ + q^ ^ + • • • + 1 vertices in each 
part. 



Theorem 1.2 Let i G {2,3,5}, k > 2£ + 2 be odd, n > Uk be sufficiently large, and define q G M+ 
by n = 2{q^ + q^~^ + ••• + !). Then any C2e-free graph G on n vertices with at least ^{q + l)n 
edges contains a cycle of length k, unless q is an integer and G is the bipartite incidence graph of a 
generalized (i+ l)-gon of order q. Furthermore, if q is an integer, n > n^. and there is a generalized 
(i + l)-gon of order q, then 

ex(n,C2£U {Cfc}) = z{n,C2e)- 

The same statement holds when i = 2 and k = 5. 

1.2 Chromatic number 

A classical result of Andrasfai, Erdos and Sos [5j states that a triangle-free n-vertex graph with 
minimum degree more than 2n/5 is 2-colorable. Generalizations of this theorem have been studied 
extensively by researchers, for example see [H [H \2T\ [30] and their references. Here we address a 
similar question for a C2£ U {Cfc}-free graph when A: > 4^ + 1 is odd. We use x{G) to denote the 
chromatic number of G. 

Theorem 1.3 Let i > 2 be an integer, and let k > M +1 be an odd integer, let c be a positive real 
number, and let G be a C2^U{Cfc}-/ree graph on n vertices with minimum degree at least en}' . Then 
xiG) < {AkY+^/c^. 

Forbidding short odd cycles in a graph generally has little effect on the chromatic number if the 
graph is too sparse. A well-known construction of Erdos in random graphs shows that there are 



graphs of arbitrarily large girth and chromatic number. Theorem 1.3 in contrast shows that the 



chromatic number becomes bounded for very dense C2^-free graphs with a forbidden long odd cycle. 
1.3 Short odd cycles 



We start by observing that the second statement of Theorem 1.2 does not hold whenever 5 < k < 2i 
and k is odd, as in this case we have 

ex(n, 621 U {Ck}) > z(n, C2^) + 1. 

To see this, consider an extremal C2^-free bipartite graph H on n vertices. Let G be obtained from 
H by adding an edge joining a pair of vertices {x, y} at distance two in one part of H. We claim 
that G has no C^ for odd k with 5 < k < 2£. For such a cycle would have to contain the edge {x, y}, 
so we would have a path P of length k — 1 in H from x to y. Adding the edges xz and yz, where z 
is a common neighbor of x and y, we obtain a closed walk of length k + 1. Furthermore, this walk is 
not acyclic, so it must contain an even cycle of length at most 2£ in H, contradicting the fact that 



H is C2£-free. It would be interesting to see if Theorem 1.2 can be extended to the last remaining 
case, namely k = 2i + 1. 

The following proposition determines an upper bound for z(n,C2^). Its proof will follow easily 



from the counting arguments in Section 3.7 



Proposition 1.4 Suppose n G N, and let q G M+ be defined by n = 2{q^ + q^ -"^ + • • • + 1). Then 

z{n,C2i) < \{q + l)n. 

Now we can describe a much stronger discrepancy between Turan and Zarankiewicz numbers when 
one forbids a short odd cycle. Consider the polarity graphs constructed by Lazebnik, Ustimenko and 
Woldar [28|. These are C2£-free graphs on n vertices with (1/2 + o(l))n^~^^' edges, such that for 
i = 3 they have no triangles and no even cycles of length at most six, and for i = 5 they have no 
triangles and no cycles of length five and no even cycles of length at most ten. Thus when (i, k) is 
(3,3), (5,3) or (5,5) we have 

,.^ .^j ex(«,c.u{c.}) ^ ,__^ ^^^ ("';;;')/2 _ ,!„_ 

n^oo Z[n,C2i) n-5-oo (n/2)^+^/'- 

so in these cases one does not even have an asymptotic result similar to Theorem |1.1[ For the case 
i = 2 and A; = 3 we do not know of such a strong discrepancy. Parsons [33J constructed {C3, C4}-free 

graphs showing that 

7 _ 

ex(n, {C3, C4}) > z(n, C4) + ^^ - 0{Vn) 

when n = (^) and q = I mod 4 is prime. On the other hand, Erdos [TOl [11] suggested that in 
the case of {C3,C4}-free graphs there should not be a stronger discrepancy, and conjectured that 
ex(n, {C3, C4}) ~ z(n, C4) - this conjecture remains open. One may also ask in general whether or 
not ex{n,C2e U {C2e~i}) ~ z{n,C2e) for i > 2. 

1.4 Organization 

This paper is organized as follows. In the next section we illustrate our ideas by sketching the proofs 
of Theorems |1.1| and 1.2 in the case of quadrilateral- free graphs. Section 3 contains the essential 



results on counting walks and paths in graphs. These are used throughout the paper, in particular 
to deduce some bounds on Turan numbers and Zarankiewicz numbers for cycles in the same section. 
In Section 4 we show that nearly extremal graphs contain large subgraphs which are almost regular. 
We prove a pseudorandomness property for nearly extremal graphs in Section 5. Section 6 contains 



the proof of Theorem 1.1 and Section 7 the proof of Theorem 1.2 Section 8 contains the short proof 



of Theorem 1.3 We make some concluding remarks in the final section. 



1.5 Notation 

We write e{G) for the number of edges in a graph G. In a graph G, let Nr{v) denote the set of 
vertices at distance exactly r from v, and let dr{v) = \Nr{v)\. For r = 1, we omit the subscript r, so 
that d(v) is the degree of v and N(v) is the neighborhood of v. We write dsiv) for the number of 
neighbors of a vertex t; in a set B. We write G[S] for the subgraph of G induced by a set S Q V{G) 
and e{S) for the number of edges in ^[5"]. Given two sets 5, T C V{G), not necessarily disjoint, 
we write e{S,T) for the number of ordered pairs (s,t) with s €z S, t £ T and st S E{G). For 
example e{S,S) = 2e{S) = 2\E{S)\. In addition to the Turan number ex{n,J^) and Zarankiewicz 
number z{n,J^) defined above, we use z(o, 6, J-") for the maximum number of edges in an J^-free 



bipartite graph that has a vertices in one part and b vertices in the other part. We let M^ denote 
the positive reals and N the positive integers. Our asymptotic notation assumes that n — )• oo; we 
write /(n) ~ g{n) if lim^^oo f{n)/g{n) = 1, and /(n) = o{g{n)) if lim^^oo f{n)/g{n) = 0. 

2 Sketch proof for 4-cycles 



In this section we outline the proofs of Theorems 1.1 and 1.2 in the case (. = 2, i.e. 4-cycles. This 



introduces the main ideas of our approach, without some technicalities that arise in the other cases. 
2.1 Stability 



We start with Theorem 1.1 for i = 2. The idea is that we can take H to be the bipartite subgraph 
of G, containing all the edges between N2{v) and N3{v) for a suitable vertex v. Suppose that G is 
a graph on n vertices with average degree d and G does not contain C4 or Ck for some odd k > 5. 
The proof proceeds by the following steps. 

1. Controlling the maximum degree. Lemma |4.1[ i) will show that we can delete at most 2n 
edges from G to obtain a subgraph G' with maximum degree A < 2y/n. 



2. Enumeration of w^alks and paths. The Blakley-Roy inequality (Proposition 3.1 ) gives a lower 
bound on the number of walks. It implies that we can find a vertex v that is the start of at least d'^ 
walks of length 3, where d' > d — 4 is the average degree of G' . Also, the bound on the maximum 
degree implies that all but 0{n) of these walks are paths, i.e. we have d^ — 0{n) paths of length 3. 



3. Finding odd cycles. We shall show in Section 3.6 that if G[N2{v)] has average degree at 
least max(6, 2k — 8), then G contains Ck, so we conclude that G[N2{v)] has average degree less than 
max(6, 2A; — 8). This implies that the number of paths of length 3 which start at v and end in N2{v) 
is 0{n). Since G is C4-free, we also have no path of length 3 starting in v and ending in Ni{v). 
Therefore, all but 0{n) of the paths found in step 2 go from v to N^^v). 

4. Conclusion. Since G is C4-free, each edge between N2{v) and N^iv) is contained in at most 
one path of length 3 from v to N^{v). Thus we obtain d^ — 0{n) edges between N2{v) and N^{v), 
and these constitute the bipartite subgraph H needed for the first statement in Theorem |1.1[ For 



the second statement, note that Proposition 1.4 implies z{n,G/{) < (1 + o(l))(n/2)^". Since H \s a. 



bipartite C4-free graph we must have d < (1 + o(l))(n/2)"^'^. If also d > (1 + o(l))(n/2)"^'^ then we 



have d ~ (n/2)^'^ and e{H) ~ e(G) ~ (n/2)^'^, which proves Theorem 1.1 for i- = 2, apart from the 
case k = 5 which is proved by slightly refining the above arguments. 

2.2 Exact result 



Now we sketch Theorem 1.2 for £ = 2. Suppose that n is large, G is a graph on n vertices with 
e(G) > (g + l)n/2, where q G M"*" is defined by n = 2{q^ + g + 1), and G does not contain C4 or Gk 
for some odd k > 7. The proof proceeds by the following steps. 

1. Pseudorandomness. Not much is known about the structure of nearly extremal graphs for 
bipartite Turan problems. Here we obtain a result in this direction. It says that the number of edges 
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Figure 1: Constructing a cycle of any specified odd length k >7 



between any two large sets in nearly extremal graph is close to what one would expect in a random 
graph with the same edge density: more precisely we shall show that if G is a C4-free bipartite graph 
on n vertices with parts X and Y and average degree d ~ {n/2)^''^, then for any 5 C X and T C y 
^JSIITI + o(n'^'^). This result is a special case of Theorem 



5.1 



we have e{S,T) 

2. Controlling the minimum degree. We reduce the proof to the case when the minimum degree 
satisfies S{G) > q/A. This uses a vertex deletion argument that is quite standard in extremal graph 
theory. We consider a sequence of graphs G = Gn, G„_i, • • • , Gt for some < t < n, where Gj_i 
is obtained from Gi by deleting a vertex of degree at most g/4, while possible. Some calculations 



show that this process must terminate with t > n/2. Now suppose that we know Theorem 1.2 holds 
under the additional assumption 5{G) > q/A. Applying this to Gt gives e{Gt) < {r + l)t/2, where r 
is defined by t = 2(r^ + r + 1). Furthermore, we have e{G) < e{Gt) + (n — t)q/A, and calculations 



Since e{G) > {q + l)n/2, we must have t 



n, 



show that this is less than {q + l)n/2, unless t = n. 

so S{G) > q/4:, and we are justified in assuming this when proving Theorem 1.2 

3. Refining the approximate structure. Now we have e(G) > {q + l)n/2 and can assume that 
6{G) > q/4. Let H he a bipartite subgraph of G with maximum size. Theorem |1.1| implies that 
e{H) ~ e{G) ~ {n/2)^'^. Furthermore, maximality of H implies that S{H) > q/8. Actually, we only 
need the fact that S{H) > cn^''^, where c > is independent of n. Now we will show that G = H. 
Label the parts oi H a.s A and B. Suppose for a contradiction that G[A\ contains an edge ac. Let 
6 be a neighbor of c in B. Now we 'explore' the graph until we reach two sets of linear size where 
we can apply pseudorandomness (see Figure 1). By the minimum degree assumption, we can take 
l^fel) \Ba\ ~ cn^/^ such that all vertices in Ai, are neighbors of h and all vertices in Ba are neighbors 
of a. Next, taking all the neighbors of vertices in Ai, we get a set Bi, with l-B^I = Q{n) and by taking 
neighbors of Ba we get a set Aa with |j4a| = Q{n). Then step 1 (Pseudorandomness) gives 

e{Aa,B^) = ^\Aa\m+o{n^/^) = e{n'/^). 
n 

In particular, we can find a path of length /c — 6 using only edges between Aa and i?ft. By construction 



this can be completed to a cycle of length k in G, which is a contradiction. We deduce that A is an 
independent set in G. Similarly B is independent, so G = H is bipartite. The characterization of 



equality now follows from a result on Zarankiewicz numbers for even cycles - see Proposition 3.9 



so this proves Theorem 1.2 for quadrilaterals. 



3 Counting walks 

The basis of estimates for both Turan numbers and Zarankiewicz numbers for even cycles is counting 
various types of walks in graphs. These counts are also used in Step 2 of the stability result (Enu- 
meration of walks and paths) and Step 1 of the exact result (Pseudorandomness) . A walk of length t 
in a graph G is a sequence focof iCi . . . vt-ict-ivt such that Vi G V{G), Cj G E{G) and Cj = {vi,Vi+i} 
for < i < t. Note that edges may be repeated in a walk, even on consecutive steps. 

3.1 The Blakley-Roy Inequality 

Let Wk (G) denote the number of walks of length A; in a graph G divided by the number of vertices in 
G - this is the average number of walks of length k starting at a vertex. If G is an d-regular graph 
on n vertices, then clearly Wk{G) = d^ . Blakley and Roy [7j proved a matrix version of Holder's 
Inequality, which shows (as a special case) that any graph of average degree d has at least as many 
walks of a given length as an d-regular graph on the same number of vertices: 

Proposition 3.1 Suppose G is a graph of average degree d. Then Wk{G) > d . 

There are many proofs of this inequality in the literature; the original proof of Blakley and Roy 
uses eigenvalues. We now briefly discuss the tight connection between walks and eigenvalues. 

3.2 Walks and eigenvalues 

Let G be a graph on n vertices with adjacency matrix A, and suppose A has orthonormal eigenvectors 
xi,X2, ■ ■ ■ ,Xn and eigenvalues Ai > A2 > • • • > A,i. Then we may write 

Wk{G) = e^A''e where e = n'^/^{l,l, . . . ,1). 



In this notation, the Blakley-Roy inequality (Proposition 3.1) mentioned earlier can be stated as 
Wk > Wi- For future reference we note the following proposition, attributed to Chris Godsil in |13j . 

l/r 1/s 

Proposition 3.2 Suppose r, s G N, where r is even and r > s. Then Wr > w. 



s 



Proof. Write e = ^CjXj. Then ^ c^ = e • e = 1 and Wr = e^A^e = X^c?A[. Jensen's inequality 
applied to fit) = rl' gives Wr = Yl 4K = E c?|A|r/^ > (E cf] Af |)"/' > vfj'. U 



3.3 Closed walks and trace 

We can also use spectral theory to count closed walks. Let w"^ be the number of closed walks of 
length A; in a graph G divided by the number of vertices in G - this is the average number of closed 
walks of length k starting at a vertex. 

Proposition 3.3 u;° := ^Tr{A'') = \ Y2=x ^i ■ 

This gives rise to a standard method for establishing a spectral gap in a d-regular graph: if w'^ 
is close to Xi = d^ , then all |Ai| for z > 1 must be small. We have the following bound on ^2^ , 2 in 
dense C2£-fi'ee bipartite graphs with roughly equal part sizes. 

Lemma 3.4 Suppose G is a bipartite graph on n vertices with part sizes n/2 + o{n), maximum degree 
A and girth at least 2£ + 2. Then 

wlt+2{G) < (1/2 + o(l))nA2 + {4AY+\ 

Proof. Consider any vertex v and a closed walk W of length 2-^ + 2 from v. If W^ is not a cycle then 
the girth assumption implies that its underlying graph is acyclic. The number of such walks W is 
therefore at most the number of closed walks of length 2^ + 2 from the root in the complete A-ary 
tree. A crude upper bound on this number is ( ^^-|^ ) A^^-*^ < (4A)^"^^, as may be seen by choosing 
the i + 1 times when the walk moves towards the root and multiplying A choices for the i + 1 times 
when the walk moves away from the root. (The exact formula is £j:2{e+i)^ ^^ ^^^ ^^ ^° ^^^ need 
this.) In the case when VK is a cycle, we estimate the possibilities by considering the neighbors a 
and 5 of f on W, and the opposite vertex u at distance i + 1 from v on W. Note that W is uniquely 
determined by a, b and u, as the girth assumption implies that there is at most one path of length i 
between any specified pair of vertices. We can choose each of a and b in at most A ways, and u in at 
most n/2 + o{n) ways (using the assumption on the part sizes). Thus the number of possibilities is 
at most (1/2 + o(l))nA^, which also takes into account the orientation of the cycle. Combining the 
two cases, the number of closed walks of length 2i + 2 from v is at most (1/2 + o(l))nA^ + (4A)^"'"^. 
Since v was arbitrary, this bound also holds for the average W2£,2(G)- '— ' 

3.4 Non-returning walks 

A non-returning walk of length fc in a graph G is a walk woco^iei . . .Vk-iCk-iVk of length k such 
that Cj 7^ ej+i for < i < k. Let VkiG) denote the number of non-returning walks of length k in 
a graph G divided by the number of vertices in G. If G is an r-regular graph on n vertices then 
clearly Vk{G) ^ ^(^ — 1) . Alon, Hoory and Linial [2j gave an analogue of Proposition 
non-returning walks. 



3.1 



for 



Proposition 3.5 Suppose G is a graph of average degree r > 2. Then Vk{G) > r{r — 1) ^ . Ifr G N 
then equality holds if and only if G is r-regular. 



In fact, they proved a slightly stronger form that the result given, which is sensitive to variations 
in the degrees of the vertices in the graph. Sidorenko [35] gave a bipartite analogue of Proposition 



3.1 Here we need a bipartite analogue of Proposition 3.5, which was proved by Hoory |23j . We will 



only need the result for walks of odd length. 

Proposition 3.6 Suppose G is a bipartite graph with parts A and B and average degree d. Let a 
he the average degree of vertices in A and j3 the average degree of vertices in B . Then for any t G N 
we have 

i^2t+i>d n {d{v)-l)"'^''^''^''^>d{a-l)\p-lf. 

V^AVJB 

If a, /3 £ N then equality holds if and only if every vertex of A has degree a and every vertex of B 
has degree f3. 

3.5 Paths 

A path is a walk which has no repeated vertices or edges. Let Pi{G) denote the number of paths of 
length i in a graph G divided by the number of vertices in G. 

Lemma 3.7 Suppose G is a graph with maximum degree A and average degree d, and £ G N. Then 

vt{G)>s-e^'-\ 

Proof. By the Blakley-Roy inequality there are at least nd^ walks of length £ in G if G has n 
vertices. Let (ui, U2, . . . , f^) be a walk of length t which is not a path. Then Vi = Vj for some distinct 
^)i £ {1;2, ...,£}. Fixing i < j, there are at most nA^~^ choices for the part of the walk up to 
Vj-i, then the next step is determined since Vj = Vi, and then there are at most A^~^ choices for 
the remaining steps. There are at most l'^ choices for i and j, and therefore the number of walks of 
length i which are not paths is at most nl'^A^^^. Dividing by n, we obtain the lemma. D 

3.6 Finding odd cycles 

We use the following lemma which is implicit in |40j . 



Lemma 3.8 Suppose G is a graph, v is a vertex of G and G[Nr{v)] has average degree at least 
2s — 4, for some r > 1 and odd s > 5. Then G contains cycles G2m+i, C'2m+3, • • • , G2m+s for some 
1 < m < r. In particular, if G is a graph not containing a cycle Ck, for some fixed odd k > 2r, then 
G[Nr{v)] has average degree less than max(6,2A; — 8). 

Proof. To give a self-contained proof of this lemma would require the duplication of large parts 
of [IQ], so instead we just sketch the argument, and refer the reader to [IQ] for the omitted details. 
Following the proof of [401 Lemma 3], we note that G[Nr{v)] has a subgraph with minimum degree 
at least s — 1, and so contains a subgraph H which is a cycle of length at least s with at least one 
chord. Next, as in the proof of [401 Theorem 1], we consider a minimal subtree T of the breadth 



first search tree rooted at v that contains V{H). By minimality T branches at its root. We let A 
be the set of vertices of H belonging to one branch of T and let B = V{H) \ A. Next we apply [401 
Lemma 2], which tells us that H contains paths from A to i? of every length i < \V{H)\, unless 
H is bipartite with bipartition {A,B). Even if H is bipartite with bipartition {A,B) we still have 
paths from A to -B of every odd length I < \V{H)\. These may be completed to cycles via paths in 
T, which all have the same length 2m, where m is the distance from the root of T to Nr{v). The 
second statement of the lemma is proved by taking s = max(5, k — 2). D 

3.7 Zarankiewicz numbers of even cycles 

In this section, we describe extremal bipartite C2£-free graphs and generalized polygons, and give a 
proof of extremality by using the walk counting arguments covered in the previous sections. In the 
cases £ G {2,3,5}, the bound nlh for ex(n,C2£) is asymptotically tight, i.e. ex(n,C2£) ~ ^n^^^'^. This 
is shown by constructions from projective geometry, due to Erdos and Renyi [12] for i = 2, and to 
Benson [6j and Singleton [36] for ^ = 3 and £ = 5, based on generalized polygons. We briefly describe 
these here, referring the reader to [28] for more details. Suppose P is a set of points, L a set of lines, 
and / an incidence relation between P and L. The bipartite incidence graph is the bipartite graph 
with parts P and L such that p G P is adjacent to £ £ L if and only if (p, £) G I. Let q > 2 and £ >2 
be integers. A generalized {£ + l)-gon of order q consists of a set of q^ + q^~^ + ••• + ! points P and a 
set of q^ + q^~^ + ••• + ! lines L with an incidence relation / such that the bipartite incidence graph 
of (P, L, /) is {q + l)-regular and has girth 2^ + 2 and diameter £ + 1. Feit and Higman |16j showed 
that generalized {£ + l)-gons of order q only exist for £ = 2,3, 5. These are known as generalized 
triangles, generalized quadrangles and generalized hexagons respectively. Generalised triangles are 
precisely projective planes, whereas the generalized quadrangles and generalized hexagons were first 
constructed by Tits in his seminal paper [39]. 

We start with the proof of an upper bound for z(n, C2^), and the characterization of equality by 
generalized polygons described above. For £ = 2 this result is well-known (see |3], pp. 273-274 for a 
nice exposition). Here it is convenient to just give the proof for odd £ > 2, as we do not need the 
other cases. 



Proposition 3.9 Let £ = 2 or £ > 2 be odd. Suppose n G N, and let q G M+ be defined by 

n = 2[q^ + g^-i H + 1). Then z{n,C2e) < Ul + 1)"' "^^^^ 

integer and there exists a generalized {£ + l)-gon on n points. 



n = 2{q^ + q^ ^ + • • • + 1). Then z{n,C2e) < ^{q + l)n, with equality if and only if q is a positive 



Proof. Let £ > 2 be odd and G be a bipartite graph on n vertices with e(G) > (g-|- l)n/2 containing 
no even cycle of length at most 2£. Then G has average degree at least q + 1. Let A and B be the 
parts of G. Then |^| + \B\ = n. By the girth assumption, for any ordered pair of vertices x, y there 
is at most one non-returning walk of length at most £ from x to y. Since £ is odd, we need only 
consider pairs with x and y in different parts, of which there are 2|^||P| < n^/2. By Proposition 



3.5, the number of non-returning walks of odd length at most £ is at least 



(^-i)/2 e 

n{q + l) ^ q^^ = n^q^=ny2. 

i=0 j=0 
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We conclude that the number of non-returning walks of length at most l is exactly 2|^||i3| = n /2 



so \A\ = \B\ = n/2 and G has diameter i + 1. Since equality only holds in Proposition 3.5 for 
regular graphs, every vertex of G has degree q + 1. Therefore G is the bipartite incidence graph of 
a generalized {(. + l)-gon. D 

We also require the following bound on unbalanced Zarankiewicz numbers z(a,b,C2i)- 

Proposition 3.10 Suppose i = 2 or £ > 3 is odd, and a,b > 1. Then 

z{a,b,C2e) < (06)2 + 21 -|-max{a,6}. 

Proof. The case i = 2 follows from a slightly stronger result of Reiman [M] . Now suppose i > 3 
is odd. Let G be a C2^-free bipartite graph with parts A and B of sizes a and b, and let a and /3 



denote the average degrees of vertices in A and B. As in Proposition 3.9, there are at most 2ab 
non-returning walks of odd length at most i. Using the lower bound from Proposition 3.6 we deduce 
that 

(^-l)/2 

2e(G) ^ {a- l)^(/3 - 1)^ < 2ab. (2) 

i=0 

Suppose for a contradiction that e{G) > {aby + max{a,6}, where c = 5 + jj- Then a — 1 = 
(e(G) - a)/a > a^-^b^ and /3 - 1 = (e(G) - b)/b > a%^-\ so (q - l)(/3 - 1) > {abf^-^ = (a6)V^. 
However, this gives 

e{G){a - lf-'y\/3 - if-'^^ > {abr{abf-'y^' = ab, 

which contradicts pi). This gives the required bound on e(G). D 

4 Degrees in nearly extremal graphs 

In this section, we show that in nearly extremal C2£-free bipartite graphs for i £ {2, 3, 5}, the number 
of edges containing a vertex of degree substantially more than the average degree is small. This is 
used in Step 1 of the stability result (Controlling the maximum degree) and Step 1 of the exact result 
(Pseudorandomness). We also show how to classify extremal graphs once extremal graphs of large 
minimum degree are classified. This is used in Step 2 of the exact result (Controlling the minimum 
degree) . 

4.1 Bounding the maximum degree of C4-free graphs 

First we show that any C4-free bipartite graph on n vertices does not contain many edges on vertices 
of degree much more than roughly (n/2)^'^, which is the average degree in extremal C4-free bipartite 



graphs by Proposition 3.9 Recall that we write dsiv) for the number of neighbors of a vertex v in 



a set B and e{S, B) for the number of ordered pairs (s, b) with s € S, b £ B and ,sb € E{G). 
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Lemma 4.1 Suppose G is a C^-free graph on n vertices and A, B C V{G). Suppose < e < v3 
and let 

S = {veA: dB{v) > (1 + e)\B\^''^}. 

Then e{S,B) < 2\B\/e. In particular, we have the following. ■ 

(i) If G is a Ci-free graph then at most 2n/e edges contain vertices of degree at least (1 + e)y/n. 
(a) If G is a Ci-free bipartite graph with parts X and Y , then e(T, X [JY) < 2n/e, where 
T = {veX : dyiv) > (1 + e)\Y\^/^} U {v e Y : dx{v) > (1 + e)\X\^/^}. 

Proof. Since G is C4-free, every choice of a vertex v in B and two neighbors s, s' of t; in 5 gives rise 
to a different pair {s, s'} in S, so 

v&B \ / V 

Since the function /(x) = x{x — 1) is convex for x > 1, Jensen's inequahty gives 

\S\{\S\-l)>e{S,B){e{S,B)/\B\-l). 

Suppose for a contradiction that e{S,B) > 2\B\/e. Then 

e{S,B)-\B\ > {l-e/2)e{S,B). 

Since e{S,B) > \S\ • (1 + s)\B\^/^ by definition of S, we have 

\B\\S\^ > e(5, B)ieiS, B) -\B\)>{1- e/2)e{S, Bf > (1 - e/2)(l + Ef\B\\S\\ 

However, (1 — e/2)(l + e)"^ = 1 + (3 — e^)e/2 > 1, contradiction. So we must instead have e(S', B) < 
2\B\/e. Now statement (i) follows by taking A = B = V{G), and statement (ii) follows by first 
taking A = X and B = Y and then taking A = Y and B = X. D 

4.2 Bounding the maximum degree in C2f-free graphs 



Next we prove an analogue of Lemma |4.1| for C2^-free bipartite graphs. The accurate estimate given 
here is only needed for the pseudorandomness argument; a cruder ad hoc method will suffice for 
bounding the maximum degree in Theorem |1.1[ In the following lemma we could give a quantitative 
description of how the o(-) estimate depends on d; however the calculations are somewhat heavy, so 
we rather retain only the asymptotic statement as n — )• oo. 

Lemma 4.2 Let e > and suppose G' is a C2e.-free bipartite graph on n vertices with average degree 
d ~ (n/2)-^'^, where £ is odd. Then at most o{e{G')) edges contain a vertex of degree at least {l + e)d. 

Proof. We start by repeatedly removing vertices of degree or 1 to leave a graph G with minimum 



degree at least 2. This process removes at most n = o{e{G')) edges, so Proposition 3.9 implies that 



we remove o{n) vertices. Let A and B be the parts of G. As in the proof of Propositions 3.9 and 
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3.10, the girth assumption imphes that the number of non-returning walks of odd length at most (. 



is at most 2|j4||i?|. Here we will just use the estimate 

Vi{G)<2\A\\B\/n<n/2. (3) 



In the lower bound for u^ we will use the full strength of Proposition 3.6 to get an improvement if 



there are many edges i nciden t to vertices of large degree. First we show that the bipartition is roughly 



balanced. Proposition |3.10| gives e{G) < {\A\\B\)2 + 2i +n. Since e{G) ~ e(G') = nd/2 ~ (n/2)i+i/^ 
we deduce that |^| ~ \B\ ~ n/2. 

Now let S be the set of vertices in A of degree at least (1 + e)d. We will show that e[S^ B) = 



o{e{G)). Proposition 3.6 gives 

n{G) > dTT{Af-^^l\{Bf-''^'\ (4) 

where for any C C V{G) we write 

For vr(i?) we just use the simple bound 

vr(S) >e[G)/\B\ - 1 ~ d. 

To see this we apply Jensen's inequality with the convex function f{x) = j;log(x — 1) for x > 2, 
using the fact that G has minimum degree at least 2. This shows that 'k{B) is minimized when 
d{v) = e{G)/\B\ for all v & B. Since e{G) ~ (n/2)^+-'^/^ and \B\ ~ n/2 we get the stated bound on 
vr(S). 

For vr(^) we estimate it{S) and 7r{A \ S) separately. We write 

\S\=an/2 and e{S,B) = pe{G). 
The parameters p and a satisfy 

(1 + o(l))(l + e)a<p<{l + o{l))a'^+^i, (5) 



wher e the lower bound follows from e{S,B) > |5|(1 + e)d, and the upper bound from Proposition 



3.10 



which gives e{S,B) < {\S\\B\)2~^2i + n. By Jensen's inequality, vr(S') is minimized when 
d{v) = pe{G)/\S\ for all v e S, and tt^A \ S) is minimized when d{v) = (1 - p)e{G)/\A \ S\ for all 
V G A\ S. Therefore 



i-P / ox « /I _ ^\ i-P 



Applying pi) and Q we deduce that 

n/2 > ui{G) > {1 + o{l))d { d^ ' ^^ ' ^ 



-a J \1 — a 
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Since d ~ {n/2)^/^ we obtain 



3l^)""'--«^ 



Now recall that our goal is to show that e{S, B) = o(e(G)), i.e. p = o(l). Suppose for a contradiction 
that we can choose an infinite sequence of graphs Gn as above with analogous parameters pn and cr„ 
such that Pn ^ c for some constant c > 0. Then we can pass to a subsequence such that C7„ — )• s and 
Pn ^- r, for some r,s G [0,1] with r > c. Then r and s satisfy 



r\r n "^1-^ 



/(^«)^= T h^- <1. (6) 



1^ 1 



We also have (l + e)s < r < 52 + 2* by ([5J). Note that this implies r < 1. Consider the function 
g{t) = f{r,t). Computation of derivatives gives 



t-r , , , ,,, , 2(t-r)2+r(l - r) 
-g(t) and g" t) = ^ /, ^^5 ^ 



Thus g'{r) = and g"{t) > for all t, so (7(i) is convex, is decreasing for t < r and is minimized at 
t = r. Since s < r/(l + e) the minimum value possible for /(r, s) is at s = r/(l + e). Substituting 
in ^ and simplifying gives 

(\ 1— r 
1 + T^ 
1 — r 

However this is a contradiction, by the standard inequality (1 + y/xY < 1 + y for x, y G (0, 1). We 
deduce that e(S', B) = o{e{G)). The same argument shows that o{e{G)) edges contain a vertex in B 
of degree at least (1 + e)d, so the proof is complete. D 

4.3 Bounding the minimum degree 



Here we implement Step 2 in the proof of the exact result, by reducing the proof of Theorem 1.2 



to the case when the minimum degree satisfies (5(G) > g/4. Recall that G is a C2^-free graph on 
n vertices with at least ^{q + l)n edges, where n is large, £ G {2,3,5} and q G M"*" is defined by 
n = 2{q^ + q^~^ + • • • + 1). We use the vertex deletion argument as in Section [2J Consider a sequence 
of graphs G = Gn, Gn-i, • • • , Gt for some < i < n, where Gi_i is obtained from Gj by deleting a 
vertex of degree at most g/4, while possible. Note that, by definition, the minimum degree of Gj is 
at least q/A. We claim that t > n/2. For suppose otherwise and consider Gn/2- We have e(G„/2) < 
^(n/2)^"'"^'^+0(n) by (ll|) (see Introduction). Also, the number of edges deleted is at most (n/2)(g/4). 
Since we assume that e(G) > {q + l)n/2 we get e(G„/2) > ^qn/^. But qn/2 ~ (n/2)^+'^/^, so this 
contradicts the upper bound. Thus we do have t > n/2. Next we claim that e{Gt) > (r + l)t/2, where 
r is defined by t = 2 ^j^qT*. To see this note that e(Gj) > e(G) — (n — f)g/4 > {q+l)n/2 — {n — t)q/4:. 
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Then, using q > r, we calculate 



{q+l)n/2-ir + l)t/2-{n-t)q/4 = (g + 1) ^g^ - (r + 1) ^r^ - (g/2) ^ <7* - ^rM 

i=0 i=0 \i=0 4=0 / 

i=0 i=0 

> (g/2 + l)(^g^-^rM > 0. 
\i=o i=o / 

We deduce that e{Gt) > {r + l)t/2, with strict inequality unless t = n. Now suppose that we 



know Theorem 1.2 holds under the additional assumption 5{G) > q/A. Applying this to Gt gives 



e(Gi) < (r + l)t/2. Thus we must have t = n, so 6{G) > q/A, and we are justified in assuming this 



when proving Theorem 1.2 



5 Pseudorandomness 



A key ingredient in the proofs of Theorem 1 1 . 1 1 and 1.2 is the notion of pseudorandomness. There are 



many equivalent notions of pseudorandomness in graphs: we refer the reader to [25j for a survey. 
In this section we will present spectral properties of graphs that imply pseudorandomness. We will 
prove the following result, which expresses the pseudorandomness property of a C2£-free bipartite 
graph of close to maximum size: for any two large sets the number of edges between them is roughly 
the same as in a random bipartite graph of the same density. 

Theorem 5.1 Suppose G is a €21- free bipartite graph on n vertices with parts X and Y and average 
degree d ~ (n/2)i/^ Then for any S O X andTOY we have e{S,T) = ^\S\\T\ + o(n^+^/^). 



This theorem for ^ = 2 is the pseudorandomness part of the sketch proof given in Section 2.2 



5.1 Pseudorandomness of regular graphs 

The exposition in this subsection repeats that in \25\ Section 2.4], so we will be brief and refer the 
reader to that survey for more details. As a warmup, we give an exposition of the fact, first proved 
by N. Alon, that a regular graph with a large spectral gap is pseudorandom (this is sometimes known 
as the 'expander mixing lemma'). Then we establish analogous results in the bipartite setting, which 
seem not have been explicitly presented in the previous literature. 

Suppose G is a graph on n vertices and let A = {auv)u,veV{G) be its adjacency matrix, i.e. auv is 
1 if uv is an edge or otherwise. (We fix some ordering vi,V2, ■ ■ ■ ,Vn of the vertices and identify Vi 
with i.) Then A is a real symmetric matrix, so has an orthonormal basis xi, X2, . . . , Xn of eigenvectors 
with real eigenvalues, which we order so that Ai > A2 > • • • > A„. Note that Y17=i ^i ~ Tr(^) = 0, 
so Ai > and A„ < 0. The Perron- Frobenius theorem implies that |Aj| < Ai for all i and all entries 
of xi are non-negative. If G is d-regular then we have xi = e := n~^'^(l, 1, . . . , 1) and Ai = d. In 
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this case it is easy to verify that |Aj| < d for all i, since if Xij has the largest absolute value among 
the coordinates of Xj then 



l^i^ij I 



liAx. 



^l]\ 



kaNij) 



< d\xij\. 



In this non-bipartite setting we write 



A = max I Aj I . 



We have the following pseudorandomness property for regular graphs (see, e.g., [3l[25]), whose short 
proof we include for the convenience of the reader. 



Lemma 5.2 Suppose G is a d-regular graph on n vertices. Then for any S,T CI V{G) we have 

d. 



e{S,T) - -\S\\T\ 
n 



<A^^:^. 



Proof. Let xs and xt denote the characteristic vectors of S and T, which are equal to 1 or in 
position V according as v belongs or does not belong to the corresponding set. Then e(5, T) = x^^Axt- 
Let {xi, a;2, . . . , Xn} be an orthonormal basis of eigenvectors, where Xi is the eigenvector corresponding 
to eigenvalue Aj, and write xs = Yll=i ^i^i and xt = Ya=i ^i^i- Then e{S,T) = Ya=i ^i^iU- Note 
that X^iLi ^1 = Xs ' Xs = \S\ and similarly X^iLi *f ~ 1^1- Thus we can estimate 



/ ^ ^iSiti 



i>l 



<A^|si||ti|<AV|S||r| 



i>l 



by the Cauchy-Schwarz inequality. Since \i = d, si = e ■ xs = 'n. ^'^I^l, and similarly ti = n ^'"^IT], 
we obtain 

e(5,r)--|5||r| <Xy^\S\\f\- □ 



n 



5.2 Pseudorandomness of regular bipartite graphs 

Now we adapt the arguments of the previous subsection to the bipartite setting. Let G be a bipartite 
graph on n vertices with parts X and Y. We choose the ordering vi,V2, ■ ■ ■ ,Vn of the vertices so 
that X precedes Y. When we consider vectors of length n this gives us a natural partition of its 
co-ordinates into two blocks corresponding to X and Y. Then the adjacency matrix A has block 
structure L^t q) where M is the bipartite incidence matrix of G, i.e. M has rows indexed by X, 
columns indexed by Y, and entries ruxy equal to 1 if xy is an edge, otherwise 0. First we consider the 
case when G is d-regular, which implies that \X\ = \Y\ = n/2. Then we have Ai = d, with eigenvector 
e := n~^''^{l, . . . , 1) as before, and An = —d, with eigenvector e := n~^''^{l, . . . , 1, —1, . . . , —1) having 
7i-i/2 [t^ [ig X-coordinates and — n~^'^ in its y-coordinates. In the bipartite setting we re-define A 

by 



A 



max I A j I . 



We have the following pseudorandomness property for regular bipartite graphs. 
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Lemma 5.3 Suppose G is a d-regular bipartite graph on n vertices with parts X and Y . Then for 
any 5 C X and T <^Y we have 



e{S,T)--\S\\T\ 
n 



< A 



\S\ + \T\ 



Proof. Let x = ixs, Xt) denote the characteristic vector of 5 U T. Then 

x'Ax = xWxT + XtM'xs = 2e{S, T). 
Writing x = Z^ILi ^*^* ^'^ ^^^ eigenvector basis we obtain 

n 

2e{S,T) = ^Xial 



i=l 



Since J27=i ^f = X ' X = I*?] + |T| we can estimate 






itti 



<|A|(|5| + |r|). 



We have 

Ai = d, ai = e-x = n-'^^^i\S\ + \T\), \n = -d and an = e ■ x = n-'^'^{\S\- \T\), 
so \ia\ + A„a^ = ^l/SHTl. This gives the stated estimate for e{S,T). 



D 



Remark. An alternative derivation of similar estimates may be obtained from the singular value 
decomposition of the bipartite incidence matrix. 

5.3 Nearly regular bipartite graphs 

We want to show pseudorandomness for 621 U {Cfc}-free graphs which are nearly extremal. Such 
graphs are not necessarily regular - for instance they may contain isolated vertices - so to treat them 



we will prove an analogue of Lemma 5.3 for nearly regular bipartite graphs. The quantity we use to 
measure irregularity of a graph is its variance. If G is a graph of average degree d with n vertices, 
then the variance of G is defined by 



Var(G) := - Y,id{v) -df = -Y, d{vf - d 



As before we set A = \{G) = maxj^i^„ |Ai| in the bipartite setting. We have the following pseudo- 
randomness property for nearly regular bipartite graphs (for a similar statement for non-bipartite 
graphs see ^5]). 

Lemma 5.4 Let /3, 7 G (0,1) and a = 4/3"*^' ^7^^ < 1/4. Suppose G = G{X,Y) is a bipartite graph 
on n vertices with average degree d and 
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r^; A(G)<(1-7H 
(ii) Var(G) < I3£. 

Then for any 5 C X and T (^Y we have 



e{S,T)--\S\\T\ 
n 



< {Aad + X/2)n. 



Recall that e = n^^'^(l, . . . , 1), and e := n~^'^(l, . . . , 1, —1, . . . , —1) has n~^/^ in its X-coordinates 
and — n~^' ^ in its y-coordinates. The following estimates will be used in the proof and later in the 
paper. 

Ai = max{x*ylx : ||3;|| = 1} > e^Ae = n^ > d{v) = d, and 

V 

Xn = m.m{x''Ax : \\x\\ = 1} < e'^Ae = n~^ V^ -d{y) = -d. 



Proof. Write |5| = s and \T\ = t. Without loss of generality s > t. As in Lemma 5.3, we consider 



the characteristic vector x = {xs^Xt) and write x = Sr=i^«^«' where {xi : 1 < z < n} is an 
orthonormal basis of eigenvectors and Xi is the eigenvector for Aj. Then Y17=i ^1 — X^ ' X = s + t 
2e{S,T) = x^Ax = X^"=i Xiof, and we estimate, with A = A(G): 



126(5*, r) - \ia\ - Xnal 






iOi 



< X{s + t) < Xn. 



(7) 



Next write e = XlILi '^*^* ^^"^ ^ ~ Z^iLi^*^*' where J2'^'i — Yl^f — ^'^ since we can replace any 
eigenvector Xi by —Xi we can assume that ci > and Cn > 0. We will show that xi is close to e and 
Xn is close to e. Consider z = Ae — de, which has co-ordinates Zi = n~^'^{d{vi) — d), and z = Ae + de, 
which has co-ordinates Zi = :izn~^''^{d{vi) — d), with positive sign when Vi £ Y and negative sign 
when Vi € X. Then by (ii), 



z ■ z = z ■ z = n 



-1 Y,idiv) - df = Var(G) < 0. 



We also have z = X](Aj — d)ciXi and z = X]('^i + d)ciXi, so 

z- z = z-z = ^q^(Aj -d)^ = y^^cfjXi + df. 
Then J2i^i c? < (^ - Xy^^d^ < ^-/-^ by (i), so ci > cf > 1 - Pj'^, and 

lie - xif = (1 - ci)2 + ^ c^ = (1 - cif + 1 - c? = 2(1 - ci) < 2/37 



-2 



Similarly, one can prove that c„ > 1 — /37 ^ and ||e — j;„|p < 2f3j ^. Next we estimate Ai and A„. 
Recall that Ai = ||Axi|| and d = \\de\\, and so by the triangle inequality, 



|Ai-d| 



\Ax-i 



\de\\\ < \\Axi — de\\ < 11^46 — (ie|| -|- ||Ae — Axi||. 



(8) 
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We noted before the proof that Ai > d, so we can write Ai = (1 + qi)d with qi > 0. Since 
\\Ae - de\\ = \\z\\ < p^/'^d and \\A{e - xi)\\ < Ai||e - xi|| < (2/3)V2^-iAi, from jsj we have qi < 
/3^/2 + (2/3)^/27^^(1 + qi). Thus, recalUng that a = 4/3^/^7"^ < 1/4, we can estimate 

^ /3V2 + (2/3)1/2^-1 ^ a/4 + a/{2V2) ^ 
^'- 1 - (2/3)1/2^-1 - i_ 1/(8^2) ' 

i.e. 

d< Xi < {l + a)d. 

Similarly, we have 

-d> Xn> -(1 + a)d. 

We also estimate ai = x " ^^i by x • e = n~^''^{s + t) and the inequality 

llx • (e - xi)f < llxf lie - xif < 2(3j-\s + t) < a^{s + t), 

which gives 

\ai - n-^/\s + t)\ <a(s + t)^/2. 

Similarly, we estimate a„ = x " 2;^ by x " e = n^^''^{s — t), and get 

\an-n-^/\s-t)\ <a(s + t)^/2. 

Now we have the necessary ingredients to estimate Aiaf + A^a^. Recall that Ai > d, Xn > —d{l + a) 
and s > t. Using the above estimates for ai, an, we have the lower bounds 

Xiaj > d(n-^/2(s + t) - a{s + t)^/^)^ and 
Xnal >-d{l + a)(n-^/2(s - t) + a{s + t)!/^)^. 

Thus we obtain 

Aia? + Anffl^ > —St -—is-tf- 2ad ( ) ( (s + t) + (1 + a)(s - t)) - a^d(s + t). 

n n \ n / \ ) 



Since s ^t <n and a < 1/2 we get 

X 2 , 2 4d 

Aiai + Xna„ st > —8adn. 

n 

The estimates for the upper bound are similar but slightly more technical. We use 

Xial < d{l + a)(n"^/2(s + t) + a{s + t)^/^)^, and 

2 f-(i(n-^/2(s-t)-a(s + t)i/2)2 if a(s + t)i/2 < n-i/2(s - t), 
^""- jo ifa(s + t)i/2>n-i/2(s-t). 

In the case q(s + t)i/2 < n" 1/^(5 — t) we have 

1 /9 

Aiai + Xnal < —st + —{s + tf + 2ad ( ) f (1 + a){s + t) + {s- t)) + a^d{s + t). 



n n \ n 
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In the case a{s + t)^^'^ > n^^/^(s — t) we have (s + t)^ — Ast = (s — t)^ < a'^n{s + t) < c?r? , so 

Aiaf + \na\ < Xiaj < d{l + a)(n"^(4st + a^n^) + 2an"^/2(s + i)^/^ + q^(s + t)). 
Smce s + i < n and a < 1/4, in both cases we obtain 

AiOi + Ana„ st < 8adn. 

n 

Combining this with ([T]) we obtain the stated estimate for e{S, T). 



D 



5.4 Proof of Theorem 15.11 

The idea of the proof is to use the connection between eigenvalues and closed walks. We can control 
the maximum degree by deleting few edges; then the main contribution to the upper bound on closed 
walks of length 2i + 2 in Lemma [j^ is {l/2 + o{l))n'^A^. This is very close to Af+^ + A2^+2 ^ 2^^^+^ 
so the other eigenvalues of this graph are small. 

We now give the details. Suppose e > 0. Consider any C2£-free bipartite graph H on n vertices 
with parts X and Y and average degree d ~ (n/2)^/^. Suppose S C X and T QY. By Lemma 



4.2 



there are o{e{H)) edges incident to vertices of degree at least (1 + e)d. We remove these edges to 
obtain a graph G of maximum degree A < (1 + £)d and average degree d ~ {n/2)^'^. Also, as in the 
proof of Lemma 



4.2 



3.10 



gives \X\ ~ |y| ~ ^. By Proposition 3.3 and Lemma 



Proposition 

1 " 

- E ^''^' = ^2V2(G) < (1/2 + o(l))nA2 + (4A)^+i 



3.4 



i=l 



Now we use the estimates Ai > d and A„ < —d. Recalling that A = maxj^i „ |Aj| we have 

A2^+2 < n^,+2(G) - 2(i2^+2 ^ (^/2 + o{l))n^A^ - 2d^'+^ + n(4A)^+i. 
Since d ~ {n/2)^/^ and A < (1 + e)d this gives \^^+^ < ((1 + ef -1 + o{l))2d^^+'^ . It follows that 

A<(6e)V(2^+2)d + o((i). 
Also, the variance is bounded as 



VAH(G) = ^■''■'<"' - ■"' = £^J^ - d < A . £^ 



n 



n 



n 



d< [l + e)d^ - d^ = ed^ . 



We now apply Lemma 5.4 with /3 = e and 7 = 1 — (6e)^'(^^^^^ + o(l). Recall that we need a 



4/3^/^7 ^ < 1/4 , which holds if e is small. Thus 

2d, 



eG{S,T) 



\T\ 



n 



< (4a(i + A/2)n, 



where a = 4/3"^' ^7 ^ < 5e^ " for small e. Recalling that G was obtained from H by deleting o{n^^^' ) 
edges we have 

D 



eH{S,T)--\S\\T\ 
n 



< 



Since e is arbitrary this proves Theorem 5.1 
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6 Proof of Theorem 11.1 



Suppose that G is a €21 U {Cfc}-free graph with n vertices with average degree d = @{n^'^), where 
A; > 2^ is odd. For the first part of Theorem |1.1[ we are required to find a bipartite graph H C G 
such that e{H) > d^^ — o{n^^^' ). Similarly to the sketch given for 4-cycles, the idea is that we can 
take H to be the bipartite subgraph spanned by Ne{v) and N^j^-i^v) for a suitable vertex v. The first 
step is to pass to a subgraph with low maximum degree. 

Let A = n-*^'^"*"^, where c := 1/21"^. Let S be the set of vertices of degree more than A, and let 
Go be the graph obtained by removing all edges of G containing at least one vertex of S, where Gq 
has average degree dg. We will show that do ~ d. To estimate the number of edges removed, recall 
from Q that ex{n,C2e) < ^n^+^Z^ + 0{n), so 

\S\ < -^ <m:= 2n^ ". 



It follows that 



e{S) < -m^+i/^ + 0{m) < n^+V^'^ 



Also Proposition 3.10 gives 

e{S, V{G) \ S) < (mn)i/2+i/2^ + n < n'+'/'-'/^. 

In particular, we have e(Go) > e(G) — o(n^+^'^), and therefore do ~ d. For the remainder of the 
proof we work in the graph Gq, which has maximum degree at most A. 

Next, by Lemma 3.7, we can choose a vertex v that is the start of at least d^^ — i'^A^ = 



d^~^^ + o{n^~^^'^) paths of length £ + 1 in Gq. We claim that all but o{n^^^'^) of these paths reach 
N£^i{v). Consider a breadth-first search tree T rooted at v. Consider any path P of length i + 1 
that does not reach Ng^i{v). Then there is a smallest i such that the {i + l)st edge of P does not 
go from Ni{v) to Ni^i(v). By construction of T this edge must either go from Ni{v) to Ni^i{v) or 
lie within Ni{v). The first case is impossible, as any edge of E{Gq) \ E(T) between Ni-i{v) and 



Ni{v) would create an even cycle of length at most 2i. For the second case we recall that Lemma 3.8 
implies that G[Ni{v)] has average degree at most 2k for any i < i, since k > 2i. Also note that from 
maximum degree assumption we have that |A'j(f)| < A*. This gives at most k\Ni{v)\ < A;A* choices 
for the (i + l)st edge of P. Let w be the first vertex of P in Ni{v). The subpath of P from v from w 
is uniquely determined (otherwise we would have an even cycle of length at most 2i) . Then we have 
at most A choices for each of the i — i subsequent edges of P. In total, the number of choices for P 
is at most kA^ = o{n^~^^'^), as required. 

Each edge between N£{v) and N£^i{v) is contained in at most one path of length I + 1 from v 
to N£^i{v), otherwise we would have an even cycle of length at most 2i. Thus, taking H to be the 
bipartite graph of edges between Ni{v) and Nij^i{v), we have e{H) = d^^"^ + o{n^^^/^)^ as required. 

To prove the second part of Theorem IliI suppose that d > (1 + o(l))(n/2)^' . The number of 
edges in the bipartite graph H constructed above is at least (l + o(l))n(i^"*"^/2 > (l + o(l))(n/2)^+^/^. 
On the other hand, from Proposition |3.9| we have e{H) < (1 + o(l))(n/2)^+^' . Therefore e{H) ~ 



(n/2)i+i/^ and d < (1 + o(l))(n/2)i/^ Since d > (1 + o(l))(n/2)V^, we also have d ~ {n/2)^/^. So 



e{H) ~ e(G) and by Proposition 3.9 this shows e(G) ~ z(n,C2^). This completes the proof. D 
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7 Proof of Theorem 11.2 



Suppose that n is large, and G is a graph on n vertices with e{G) > {q + l)n/2, where q G M'^ is 
defined by n = 2{q^ + q^~^ + ••• + !). Suppose also that G does not contain an even cycle of length 
at most 2i, or Ck for some odd k > 21 + 2. We will show that e{G) = {q + l)n/2 and G is bipartite. 



Then Proposition 3.9 characterizes equality, namely, G must be the incidence graph of a generalized 
polygon. We start by considering the case I > 2 and k > 2i + 2. 

Case 1 : ^ > 2 and k > 2i + 2. Let H he a bipartite subgraph of G with maximum size. We 
show G = H. By Theorem IliI e{H) ~ e(G) ~ (n/2)^^^/ . Furthermore, maximality of H implies 
that d{H) > 5{G)/2, as if there were a vertex of degree less than 5{G)/2 in H we could move it to 
the other part and increase the number of edges in H. By Section 4.3, we can assume 5{G) > q/A, so 
5{H) > q/8. Label the parts of i? as Xq and Xi, with XqUXi = V{G). Suppose for a contradiction 
that G[Xo] contains an edge {x,y}. Let z be a neighbor of y in Xi. We greedily construct a sequence 
of mutually disjoint sets {y} and SI and 5* for < i < £, where S^ = {x}, S^ = {z}, 

Si C N{Si-^) n Xi ^od 2 and SI c iv(5r') n Xi+i „,od 2. 

Note that by definition we have S^. C Ni{x) and SI Q Ni{z). By consideration of breadth first search 
trees as in the proof of Theorem |1.1[ every vertex in Ni{x) has exactly one neighbor in Ni-i{x) and 
e{Ni{x)) < A;|A'^j(x)| for every 1 < i < i — 1. Moreover, two distinct vertices of Ni(x) can not have 
a common neighbor in Ni^i{x). Similar statements hold for z. By the minimum degree assumption 
in H^ we have \Ni{x)\, |A^j(^;)| > {q/8y for all < i < ^. Since q ~ (n/2)^, this allows us greedily to 
choose disjoint sets 5* , SI so that 

\^x\ \^z\ y-"') 1 



for all 1 < z < ^ and some constant c > 0. Now we apply Theorem 5.1 to H with S = S^ and T = S^. 
This gives 

_ ^ (l + o(l))2(,./2)V' _ , ^ ^^^ 

n 
In particular, we can find a path of length k — 2£ — 2 using only edges between S and T. By 
construction this can be completed to a cycle of length k in G, which is a contradiction. We deduce 
that Xq is an independent set in G. Similarly Xi is independent, so G = H. This completes the 
proof for k > 2i + 2. 

For the remainder of the proof we consider the special case 1 = 2 and k = 5. 

Case 2:1 = 2 and k = 5. We use a similar vertex deletion argument to that in Section 4.3 to 
find a subgraph Gt of G with t > [O.OlnJ vertices and S{Gt) > O.blt^'^. Starting with G = Gn, we 
produce a graph Gi with i vertices by deleting a vertex of Gj+i of degree at most 0.51(z + 1)^'^ for 
each i < n. After t steps, the total number of edges deleted is less than 

rn+l 



V 0.51ii/2 < 0.51 r xi/2 dx = —{{n + 1)3/2 _ (^ ^ i)3/2)_ 
i=t+i -^t+i ^ 

Suppose that we fail to find a subgraph of minimum degree more than 0.51t^" in n — i = [0.99n] 
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steps. Then we have a graph Gt with [O.OlnJ vertices and 
e(G,) > (, + l)n/2 - ^((n + 1)=^/^ - (t + 1)^/^) > ^n'/' " ^((" + ^f" " (* + 1)'^') 

> f 1 _ 1:2^ _ 0(1)^/' > 0.01n3/2 > -^Q^3/2_ 

~ V2V2 3 ' V 

This contradicts Theorem |1.1[ provided t = [O.OlnJ is large enough. So G has a subgraph Gt with 
t vertices and S{Gt) > 0.51t^ " and where t > [O.OlnJ . For i < n let qi be the unique positive real 
defined by i = 2{qf + Qj + 1), so that q^ = q. Then qi = {yj2i — 3 — l)/2, and if t < n then 

e(Gn-i) > e(G) - O.Sln^/^ > (^ + l)n/2 - O.Sln^/^ > (g^__^ + l)(n - l)/2 

for large enough n, using 

{q + l)n/2 - (g„_i + l)(n - l)/2 = (V2n - 3 - l)n/4 - (V2n - 5 - l)(n - l)/4 
= V2n - 5/4 + (V2n - 3 - V2n - 5)n/4 - 1/4 ~ —=n^l'^ > O.Sln^/^. 

Repeating this calculation, we get that 

if t < n then e{Gt) > {qt + l)t/2, (9) 

provided t > [O.OlnJ is large enough. We will show that Gt is bipartite. 

First we pass to a maximum bipartite subgraph H of Gt as in Case 1, with parts Xq and Xi. 
By Theorem 1.1, e{H) ~ e{Gt) ~ (t/2)^/^ . We claim that no vertex x ^ Xq has more than 0.09i^/^ 



neighbors in Xq, and similarly for Xi. To see this, note that such an x G Xq also has more than 
0.09t^" neighbors in Xi by maximality of H. Then N2{x) contains 0(n) vertices of Xq and 0(n) 
vertices of Xi. Choose a set S of 0(n) vertices of N2{x) n Xq and a set T of 0(n) vertices of 
N2{x) n Xi such that for each w £ S and z €z T, there exist paths of length two from x to w and 



from X to z which share only the vertex x. Then in H we apply Theorem 5.1 (pseudorandomness) 
to conclude 

e(S,T)>(l±«M!|<;||T|-„(tn 

In particular, e{S, T) 7^ and evidently there is a cycle of length five through x and any edge between 
5 and T, a contradiction. Therefore no vertex has more than 0.09t^'^ vertices in its own part. It 
follows that every vertex has degree at least 0.501f^''^ in H. 



We next claim that |Xo| ~ \Xi\ ~ t/2. First note from Proposition 3.10 that 
e{H) < z(t, Q) < (|Xo||Xi|)3/4 + max{|Xo|, |Xi|}. 
On the other hand, e{H) ~ e(G) ~ (t/2)'^/^, and so we see 

(|Xo||Xi|)'/'>(l + o(l))(t/2)3/2. 
Since |Xo| + |Xi| = t, and we just observed |Xo||Xi| > (l+o(l))(t/2)^, we conclude |Xo| ~ \Xi\ ~ t/2. 
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Now we show Gi[Xo] and Gi[Xi] have no edges, so that Gt is bipartite. Suppose for a contradiction 
that Gt has an edge {x^y} with x,y G Xq. Note that x and y have at most one common neighbour, 
since G is C4-free. Let z be this common neighbour if it exists, or an arbitrary vertex otherwise. Let 
S be the set of ends of paths of length two in H that start at x and avoid {y, z}. Let T be the set 
of ends of paths of length two in H that start at y and avoid {x, z}. Since H has minimum degree 
more than O.SOlt^'^, each of S and T have size at least (0.501t^'^ — 1)^ > |Xo|/2, provided t is large 
enough. Then since S,T C Xq there is a vertex w G SCiT. Thus we have paths xaw and ybw, where 
a ^ h since our paths avoid z. However, xawby forms a 5-cycle, so we have a contradiction. We 
conclude Gi[Xo] is empty, and similarly, Gt[Xi] is empty, so Gt is bipartite. 

To complete the proof, recall that e{Gt) < [qt + l)t/2 by Proposition 3.9 However by ([9]), 



e(G) > {qt + l)i/2 for t < n. Thus we must have t = n and e{Gt) = e(G) = {q + l)n/2, so 



Gt = G. Therefore G itself is bipartite, and by Proposition 3.9 G is the bipartite incidence graph of 



a projective plane. D 



8 Proof of Theorem 1.3 



Let £ > 2 and k > M + 1 he odd and c > 0. Suppose that G is a C21 U {Gfc}-free graph on n 
vertices with minimum degree at least cn}'^ . We need to show that x{G) < {AkY^^/c^. We use the 
approach of Thomassen [37j . Consider a maximal sequence of vertices vi , t;2 , • . . , "Ug such that the Ith 
neighborhoods Ng{vi) are pairwise disjoint. For r > write N<riv) = Nq{v) L) Ni{v) U ■ ■ ■ L) Nr{v). 
By Lemma |3.8[ since A: > 2^ + 1, for any v G V{G) and r < £ we have e{Nr{v)) < k\Nr{v)\. Since G 
is C2f-free, no vertex of Nr{v) has more than one neighbor in Nr-i{v), so 

eiN<riv)) <ik + l)\N<riv)\ < 2k\N<riv)\. 

On the other hand, since G has minimum degree at least en}' , 

e{N<r{v)) > '^cn^/'\N<r-i{v)\. 

We conclude that \N^r{v)\ > jj:cn^'^\N^r-i{v)\ for all r < £, which implies 

\N<dv)\ > ppn 

for every vertex v £ V{G). In particular, this holds for ui, W2) • • . , fs, so s < (Ak) /c . The maximality 
of s implies that any vertex v is within distance 2i of some Vi. Now we use the assumption that 



k > Ai + 1. The proof of Lemma 3.8 shows that for r < 2£, every subset of Nr{v) induces a 
subgraph of average degree less than 2k. It follows that G[Nr{v)] has chromatic number at most 2k. 
Furthermore, G[Ni{v) U Ns{v) U • • • U iV2^-i(f)] has chromatic number at most 2k, since there are 
no edges between Ni{v) and iVj+2(^') for any i, and similarly G[No{v) L) N2{v) U ■ ■ ■ U N2e{v)] also has 
chromatic number at most 2k. Therefore G[N<2e{v)] has chromatic number at most 4fc, so we can 
cover N<2t{vi) by at most Ak independent sets for 1 < i < s. It follows that 

X{G) < Aks < ^^. 
This completes the proof of Theorem |1.3[ D 

24 



9 Concluding remarks 

• Our stability approach not only gives extremal results but describes the approximate structure of 
nearly extremal graphs. We only needed these results in the bipartite (Zarankiewicz) setting, but 
we note that very similar arguments give analogous results in the non-bipartite (Turan) setting. For 
example, we have the following result. Suppose G is a C4-free graph on n vertices with average 
degree d ~ y/n. Then for any S,T Q V{G) we have e{S,T) = -|5||T| + o{n^''^). The proof is very 



similar to that of Theorem 5.1 First we control the maximum degree as A < (1 + e)d by deleting 



0{n) = o{n^''^) edges. Then the argument of Lemma 3.4 shows that w'^{G) < (1 + o(l))nA^; the 
only difference is that there are n — 1 choices for u rather than n/2 + o(n). On the other hand, 
""^eC^) ~ ^ S ^f ^^^ ^ contribution of d^/n ~ n? from the first eigenvalue, so the other eigenvalues 
are o{d) as e — )• 0. The pseudorandomness property now follows from the non-bipartite version of 



Lemma 5.4 which is given in j25t Section 2.4]. 



• No result similar to Lemma 4.1 can hold for Cg-free graphs: in fact by the results of |20j . there 
exist 5,e > such that any extremal Cg-free graph G with average degree d has at least Se{G) edges 
containing a vertex of degree more than (1 + e)d. 



• We proved the even girth result Theorem 1.2 under the assumption that the forbidden odd cycle 
length satisfies k > 21 + 3. The polarity graphs show that no such result holds for 3 < k < £, but 
some values of k remain open. In particular, one might think that the case k = 2i + 1 should be 
approachable by the methods we used to handle {C4, Csj-free graphs. However, the vertex deletion 
method does not give sufficient minimum degree for a straightforward adaptation of this argument, 
so other ideas are needed. 

• The polarity graphs have arbitrarily large chromatic numbers, as shown by estimates on their 



independence numbers by Godsil and Newman [22j. Thus Theorem 1.3 does not hold when {k,i) G 
{(3, 2), (3, 3), (3, 5), (5, 5)}, but it seems likely that it should hold when k > 2i + 1. We also remark 
that the bound {4kY~^^/c^ is unlikely to be the correct dependence of x{G) on c, and it would be 
interesting to determine this even for i = 2. 

• We remarked earlier that there is little known about the approximate structure of nearly extremal 
graphs. In the case of 4-cycles, Fiiredi [17^ [T8] showed that any C4-free graph on q'^ + q + 1 vertices 
has at most 2 '7(9+ 1)^ edges, and for large q equality can only hold for polarity graphs. However, we 
do not know whether all nearly extremal graphs are structurally close to polarity graphs. Such an 
understanding would probably have implications for the conjecture of Erdos that ex(n, {C3, C4}) ~ 
z(n, C4), and for other seemingly more basic questions, such as whether a graph on n vertices with 
at least ex(n, C4) + 1 edges must contain many 4-cycles. 

• We also considered more generally the problem of determining bipartite graphs F and odd integers 
k for which ex{n,{F,Ck}) ~ z{n,F) and developed different methods to attack this problem when 
F is not an even cycle. This will be the subject of a second paper, co-authored with P. Allen. 
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